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We present a search for the standard model Higgs boson produced with a Z boson in 4.1 fb _1 of 
data collected with the CDF II detector at the Tevatron. In events consistent with the decay of the 
Higgs boson to a bottom-quark pair and the Z boson to electrons or muons, we set 95% credibility 
level upper limits on the ZH production cross section times the H — > bb branching ratio. Improved 
analysis methods enhance signal sensitivity by 20% relative to previous searches beyond the gain 
due to the larger data sample. At a Higgs boson mass of 115 GeV/c 2 we set a limit of 5.9 times the 
standard model value. 
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mediated by a Higgs field that manifests a particle, the 
as-yet-unobserved Higgs boson. A SM Higgs boson with 
mass (M H ) below 1 14.4 GeV/c 2 or with M H between 162 
and 166GeV/c 2 has been excluded at 95% confidence 
level in direct searches at LEP pQ and the Tevatron [2] . A 
recently reported [5] preliminary update of the Tevatron 
result has expanded this 95% exclusion range to values 
of M H between 158 and 175GeV/c 2 . 

At the Tevatron, production of the Higgs boson is 
dominated by the direct production process gg — > H, 
and for Mh < 135 GeV/c 2 the Higgs boson decays pri- 
marily to a pair of b quarks, H — > bb [4]. The process 
gg — > H — > bb is overwhelmed by multi-jet background. 
Associated production of a Higgs boson with a leptoni- 
cally decaying W or Z boson yields a distinct signature 
for efficient selection and in turn greater sensitivity de- 
spite a significantly smaller cross section than direct pro- 
duction [6] . This Letter presents an improved search 
for the SM Higgs boson produced in association with a 
Z boson, ZH — » £ + £~bb (where I is either an electron, 
e, or muon, fi) using 1.96 TeV pp collision data corre- 
sponding to 4.1 fb -1 of integrated luminosity collected 
with the CDF II detector [7] . This search channel is one 
of the most sensitive to a low-mass SM Higgs boson at 
the Tevatron 0(5]. 

Previous CDF efforts in this mode used an artificial 
neural network classifier (NN) [10] or a likelihood based 
on matrix-element probabilities (MEP) [TT] for signal iso- 
lation. Here we enhance these techniques with NN-based 
b jet discrimination [T2J H3] and an improved multivari- 
ate jet-energy correction to improve background rejec- 
tion. New Z — > e + e~ selections increase the acceptance 
of ZH signal, and new combinations of b jet identifiers 
yield better signal sensitivity, as reflected in the expected 
cross section limit. These additions improve the expected 
sensitivity by a factor of 1.2 over the gain expected just 
from additional integrated luminosity. 

The selection requirements most relevant to this anal- 
ysis are discussed below; a detailed presentation can be 
found in [13] . We select ZH candidates by first identi- 
fying a sample of events containing a Z — > £ + £~ decay. 
Events are selected in real time (triggered) based on the 
presence of high-p^ electron and muon [15] candidates. 
The majority (~ 80%) of ZH candidates pass the trigger 
selection requiring events to contain at least one central 
(M < 1-0) track of pr > 9GeV/c matched to an electro- 
magnetic energy (EM) cluster of Et > 18 GeV (a trigger 
electron) or at least one central track of pr > 20GeV/c 
pointing to signals in the muon detectors (a trigger 
muon). The remaining fraction of ZH candidate events 
comes from newly included data selected by a trigger that 
requires two or more EM clusters of Et > 18 GeV and 
\r)\ < 3.6 without requiring that the clusters are associ- 
ated with tracks (trackless trigger). Events are further 
required to contain a lepton pair that forms a Z candidate 
with mass in the range 76 < M« < 106 GeV/c 2 . Pairs of 



central leptons forming Z candidates must have opposite 
charge; electrons in the forward (\r}\ > 1.0) acceptance 
of the detector may not have an associated track and no 
charge requirement is imposed. 

We divide the Z candidates into two categories based 
on signal-to-background ratio (S/B). The search for the 
signal in these two categories is conducted separately to 
improve sensitivity to a ZH signal. The high-S/B cate- 
gory includes Z candidates formed from a trigger muon 
and a second muon candidate with px > lOGeV/c, or a 
trigger electron paired with a second electron candidate 
formed from either a central EM cluster of Et > 10 GeV 
matched to a track of pt > 5GeV/c or a forward EM 
cluster of Et > 18 GeV. The low-S/B category con- 
tains Z candidates in events satisfying the trackless trig- 
ger only or formed from a trigger electron paired with an 
isolated central track with pt > 20 GeV/c pointing to an 
uninstrumcntcd region of the calorimeter. The low-S/B 
category is included for the first time in the search for 
ZH production at CDF. 

Higgs boson candidates are assembled from pairs of 
jets. We form jets from energy deposits in the calorimeter 
using the jetclu algorithm [TB] with a cone radius of 0.4. 
We consider only jets well contained in the calorimeter, 
\r)\ < 2.0, and well separated from the Z-decay leptons, 
AR > 0.4 [TT]- Measured jet energies are corrected to 
account for ^-dependent variations in detector response, 
calorimeter coverage, and the expected contribution from 
additional pp pair interactions in the same event [18j . 
Events are required to have one jet with Et > 25 GeV 
and a second of Et > 15 GeV. We refer to the events 
containing a Z boson candidate and two such jets as the 
PreTag sample; b quark identification (described below) 
is applied to the PreTag sample to form our final analysis 
samples. There are 11,806 (3,061) events in the high 
(low) S/B PreTag data sample, wherein we expect 5.0 ± 
0.7 (0.8 ± 0.1) ZH signal events for M H = 115GeV/c 2 . 
Inclusion of the low-S/B subsample increases the total 
signal yield in the PreTag sample by 16%. 

The PreTag sample consists mainly of Z+light flavor 
(l.f.) jet (u ; d,s,g) events, with smaller contributions from 
Z+heavy flavor (h.f.) jet (c,b), it, and diboson processes. 
To reduce the Z+l.i. background, we look for b jets in the 
event. We use two algorithms to identify (tag) b jets: one 
based on evidence for a decay displaced spatially from the 
pp interaction point (SV) [7] and one based on track im- 
pact parameters with respect to the pp interaction point 
(JP) [IH]. For the SV algorithm, there are two operating 
points: tight and loose 20J . The tight operating point 
has better l.f. -jet rejection (smaller mistag probability) 
at the expense of reduced &-jet identification efficiency. 

We select events in the PreTag sample using the b tag- 
ging algorithms on the jet pairs forming Higgs candi- 
dates. We require the jet pairs to satisfy one of the fol- 
lowing classifications, in order of precedence from high- 
est to lowest ratio of ZH signal to background: a pair 
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containing two SV-tight-tagged jets is classified as tight- 
double-tagged (TDT) ; a pair consisting of one SV-loose- 
tagged jet and a second JP-tagged jet is classified as 
loose-double-tagged (LDT); and a pair where only one 
jet has a SV-tight-tag is classified as single-tagged (ST). 
When there arc multiple H — > bb candidates in an event, 
the jet pair with the highest b tag class is selected (TDT 
> LDT > ST). This is the first time the JP algorithm 
has been used in the ZH — > £ + £~bb channel. While the 
b tag selection matches the signal selection efficiency (ap- 
proximately 60%) and background rejection rate (96%) 
of previous efforts, the addition of the new LDT class in- 
creases sensitivity to a ZH signal by 6%. Due to differing 
background composition, the events are divided into in- 
dependent subsamples based on 6-tagging class. With 
two Z boson S/B categories and three 6-tagging classes, 
we form a total of six subsamples that we analyze for ZH 
content. 

We compare the 6-tag data to a model of signal and 
backgrounds to estimate the signal content. Signal, ti, 
and diboson events are modeled with the pythia [21] 
event generator. Backgrounds from Z+h.i. processes 
are simulated at the quark level using ALPGEN [22] , then 
passed to pythia for hadronization. The detector re- 
sponse is modeled with a detailed detector simulation [7J . 
We estimate the background from Z+\l. mistags us- 
ing re-weighted PreTag data with weights reflecting the 
probability for a l.f. jet to be erroneously b tagged. 
A small fraction, much less than 1%, of jets can be 
erroneously identified as electrons, resulting in a back- 
ground of misidentified (mislD) Z — > ee candidates. The 
likelihood of jet-electron misidentification is measured in 
generic jet data and applied to all electron-jet and jet- 
jet pairings in the electron triggered samples to generate 
a model of mislD Z events. The mislD Z fx/j, back- 
ground contribution is estimated from the events pass- 
ing all selection requirements but containing like-charge 
muon pairs. Predicted and observed event totals are 
listed in Table [IJ for the 6-tagged subsamples. 

To improve the separation of ZH from background, 
we utilize several multivariate techniques that use kine- 
matic quantities as inputs. The dijet mass Mjj is one 
of the most useful quantities, with its separating power 
limited mainly by the jet-energy resolution. In ZH sig- 
nal events with Z —> l + £~ , incorrect measurement of 
jet energies results in apparent missing transverse energy 
[21]. We correct jet energies, based upon the J?t, 
and thereby improve the resolution on the dijet invariant 
mass. Jet-energy correction factors are computed by a 
NN trained to match measured jet energies to parton- 
lcvel energies in Z+jets and signal events. In this anal- 
ysis, this neural network is improved compared to the 
previous analysis [10] by utilizing additional input vari- 
ables describing the recoil of the Z boson. The corrected 
jet energies are used to recompute the Higgs candidate 
mass Mh, the pr of the jets, the pr of the Higgs candi- 




FIG. 1: The dijet invariant mass distribution of the two jets 
with the highest Et in the PreTag sample. The distribution 
is shown for data after NN correction of jet energies. The dijet 
mass is shown for background before (dash-dotted line) and 
after (grey region) NN correction. The dotted (solid) curves 
show the dijet mass for signal {Mh = 115 GeV/c 2 and scaled 
by a factor of 1500) before (after) correction. 



date, the projection of J£t onto the lower-£/r Higgs jet, 
and the sphericity. The effect of the NN corrections on 
the reconstructed H — > bb mass is shown in Fig. [Tj In 
signal the resolution [24] on Mh is improved from about 
18% to 12%. 

To exploit the combined signal-to-background discrim- 
ination power of event quantities and their correlations, 
we employ a neural network discriminant (NN^) trained 
to simultaneously separate ZH, ti, and Z + jets events. 
The NNd is configured to return values of (x, y) = (1, 0) 
for ZH events, (0, 0) for Z+jets, and (1, 1) for ti. A sep- 
arate NN^> is formed for each of the three 6-tag classes. 
In addition to the quantities recomputed with corrected 
jet energies, the NNd inputs include: $t] MEPs for ZH, 
tt, and Z+jets processes; the number of jets in the event; 
and the output of a & jet identifying artificial neural net- 
work (NNt) [12]. The ZH process MEP is computed [UJ 
by convolving the theoretical matrix element for ZH pro- 
duction with detector resolution functions. The resulting 
MEP reflects the likelihood that an event is signal. We 
calculate two additional MEPs under the hypotheses that 
an event is Z+jets or tt to aid background rejection. The 
NN& augments the performance of the SV algorithm by 
isolating incorrectly ^-tagged l.f. jets. The addition of 
NNf, as an input enhances the ability of the NN^> to dis- 
tinguish ZH from Z+l.L, which constitutes 40% of the 
total background in the ST class. One-dimensional pro- 
jections of NNd output are shown in Fig.[2]for each &-tag 
class. 

The effect of systematic uncertainties on the determi- 
nation of signal content requires propagating uncertain- 
ties on NNx) input quantities to the output distributions. 
The dominant effect on the result comes from uncertain- 
ties on cross sections for background processes — a 40% 
uncertainty is assumed on the normalization of Z + h.f . 
samples [25], 11.5% for the diboson samples [26 , 20% 
to account for the theoretical and experimental uncer- 
tainty on the ti cross section [27], and 5% for ZH sig- 
nal [28]. Uncertainties of 4% (ST), 8% (TDT), and 11% 
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TABLE I: Comparison of the expected mean event totals for background and ZH signal with the observed number of 
data events. The totals are for full event selection, and systematic and statistical uncertainties are combined in quadrature. 
Systematic uncertainties dominate. The background composition is provided in the first five rows. Each of the six ZH 
subsamples is presented in a separate column. 





High S/B 






Low S/B 




Process 


TDT LDT 


ST 


TDT 


LDT 


ST 


tt 


7.0 ±1.5 8±2 


17 ±4 


2.9 ±0.6 


3.2 ±0.8 


8.9 ± 1.9 


Diboson 


2.9 ±0.4 4±1 


16 ±2 


0.5 ±0.1 


0.6±0.1 


3.3 ±0.5 


Z + hi. 


18 ± 7 30 ±13 


159 ± 67 


3.5 ± 1.5 


5.6± 2.4 


32 ± 14 


z + u. 


0.9 ±0.3 9±3 


152 ±23 


0.4 ± 0.1 


3.8 ± 1.3 


50 ± 7.6 


misID Z 


0.7±0.3 2±1 


22 ± 11 


1.4 ±0.7 


1.1 ±0.5 


23 ± 12 


Total Bkg. 


29 ± 8 53 ± 14 


366 ± 72 


9± 2 


14 ±3 


117 ± 20 


Z#(115GeV/c 2 ) 0.7 ±0.1 0.7 ±0.1 1.7 ±0.2 


0.11 ±0.01 


0.11 ±0.03 0.28 ±0.05 


Data 


23 56 


406 


12 


14 


116 




Projection of NN Output Projection of NN Output Projection of NN Output 



FIG. 2: Projections of the two-dimensional neural network (NNd) output onto the z-axis (x and y are defined in the text) 
for events in the 6-tag categories ST, LDT, and TDT. Events with a NNd score of y > 0.1 are omitted to highlight the signal 
region. The ZH contribution is shown, multiplied by a factor of 25, for Mh = 115GeV/c 2 . 



(LDT) on the normalization of 6-tagged samples are ap- 
plied to account for different 6-tag efficiencies in data and 
simulation. Other uncertainties affecting sample normal- 
izations include: 6% on the integrated luminosity, 1% 
on the trigger and lepton reconstruction efficiencies [53], 
1.5% on the measurement of lepton energies, and a 50% 
uncertainty on the total lepton misID estimate. We in- 
clude uncertainties on jet energies, the modeling of initial 
and final state radiation, and mistag event weighting as 
variations on the shape and normalization of the NN^ 
output. 

We calculate a limit on the associated production of a 
SM Higgs boson and Z boson based on the comparison of 
the full NN o output of the 6-tagged data to expectations 
for signal and background. We consider eleven Higgs 
mass hypotheses between 100 and 150 GeV/c 2 . We use a 
Bayesian algorithm |30j with a flat prior in the produc- 
tion cross section, integrating over the priors for the sys- 
tematic uncertainties, incorporating correlated rate and 
shape uncertainties, and uncorrelated bin-by-bin statis- 
tical uncertainties [3T]. Systematic uncertainties reduce 
the sensitivity of this search by 16%. The expected 95% 
credibility level (C.L.) limits are calculated assuming no 
signal, based on expected backgrounds only, as the me- 
dian of 1000 simulated experiments. The ±1ct and ±2<r 
expected limits are derived from the distribution of the 



Expected 

— Observed 
■ ±lo 








o 





















1 100 110 120 130 140 150 



M H (GeV/c 2 ) 

FIG. 3: The expected (dashed curve) and observed (solid 
curve) ZH cross section upper limits divided by the SM cross 
section are shown as a function of the Higgs boson mass. The 
dark (light) band represents the ±1<t (±2cr) expected limit 
range. 



simulation limits at the 16 th , 84 th , 2 nd , and 98 th per- 
centiles of the distribution, respectively. The observed 
95% C.L. on the ZH cross section are calculated from 
the 6-tagged data, and displayed in Fig. [3] and summa- 
rized in Table |TTJ 

In conclusion, we have searched for the SM Higgs boson 
produced in association with a Z boson, where Z —> l + 1~ 
and H — ¥ bb, finding no significant evidence for the pro- 
cess. We set 95% C.L. upper limits on the ZH produc- 
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TABLE II: Expected (Exp.) and observed (Obs.) 95% C.L. 
upper limits on the ZH production cross section times the 
branching ratio for H — > bb normalized to the SM value for 
Higgs masses between 100 and 150 GeV/c 2 . The assumed ZH 
cross section and branching fraction for H — > bb are 0.11 pb [5] 
E] and 0.73 g] for a 115GeV/c 2 Higgs boson. 



M H 


100 


105 


110 


115 


120 


125 


130 


135 


140 


145 


150 


Exp. 


6.7 


6.4 


6.3 


6.8 


8.5 


10. 


13 


19 


29 


45 


74 


Obs. 


4.5 


4.6 


5.3 


5.9 


7.9 


8.1 


10. 


14 


19 


24 


43 



tion cross section times the H — > bb branching ratio for 
Higgs boson masses between 100 and 150GeV/c 2 . For a 
Higgs boson mass of 115GeV/c 2 we set (expect) a 95% 
C.L. upper limit of 5.9 (6.8) times the standard model 
prediction. This result is an important step forward in 
the search for the Higgs boson and the source of EWSB, 
improving upon the previous observed (expected) limits 
in this channel by factors of 2.2 to 3.7 (1.9 to 2.4). 
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